Assignment of Endogenous Retrovirus Integration Sites Using a Mixture Model

نویسندگان

  • DAVID R. HUNTER
  • LE BAO
  • D. R. HUNTER
  • L. BAO
چکیده

Structural variation occurs in the genomes of individuals because of the different positions occupied by repetitive genome elements like endogenous retroviruses, or ERVs. The presence or absence of ERVs can be determined by identifying the junction with the host genome using high-throughput sequence technology and a clustering algorithm. The resulting data give the number of sequence reads assigned to each ERV-host junction sequence for each sampled individual. Variability in the number of reads from an individual integration site makes it difficult to determine whether a site is present for low read counts. We present a novel two-component mixture of negative binomial distributions to model these counts and assign a probability that a given ERV is present in a given individual. We explain how our approach is superior to existing alternatives, including another form of two-component mixture model and the much more common approach of selecting a threshold count for declaring the presence of an ERV. We apply our method to a data set of ERV integrations in mule deer (Odocoileus hemionus), a species for which no genomic resources are available, and demonstrate that the discovered patterns of shared integration sites contain information about animal relatedness.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Computational and Statistical Analyses of Insertional Polymorphic Endogenous Retroviruses in a Non-Model Organism

Endogenous retroviruses (ERVs) are a class of transposable elements found in all vertebrate genomes that contribute substantially to genomic functional and structural OPEN ACCESS Computation 2014, 2 222 diversity. A host species acquires an ERV when an exogenous retrovirus infects a germ cell of an individual and becomes part of the genome inherited by viable progeny. ERVs that colonized ancest...

متن کامل

Chromosomal localization of three endogenous retrovirus loci associated with virus production in White Leghorn chickens.

Proposed mechanisms for the generation of endogenous retrovirus loci have been examined by determining the chromosomal distribution of these loci by means of in situ hybridization. Unlike the clustering on chromosome 1 of five endogenous retrovirus loci associated with the gs- chf- phenotype A. Tereba and S. M. Astrin, submitted for publication), three loci associated with endogenous retrovirus...

متن کامل

Estimation of human endogenous retrovirus activities from expressed sequence databases

Human endogenous retroviruses (HERVs) are remnants of ancient retrovirus infections and now reside within the human DNA. Recently HERV expression has been detected in both normal tissues and diseased patients. However, the activities (expression levels) of individual HERV sequences are mostly unknown. In this work we introduce a generative mixture model, based on Hidden Markov Models, for estim...

متن کامل

Integration target site selection by a resurrected human endogenous retrovirus.

At least 8% of the human genome was formed by integration of retroviral DNA sequences. Here we analyze the forces directing the accumulation of human endogenous retroviruses (HERVs) by comparing de novo HERV integration targeting with the distribution of fixed HERV elements in the human genome. All known genomic HERVs are inactive due to mutation, but we were able to study integration targeting...

متن کامل

Intact EAV-HP endogenous retrovirus in Sonnerat's jungle fowl.

The EAV-HP group of chicken endogenous retrovirus elements was previously shown to be defective, with large deletions of the pol gene. In this report, we demonstrate that genomes of other Gallus species also maintain EAV-HP elements with similar deletions. The chicken EAV-HP1 locus was detected in both red (Gallus gallus gallus) and Sonnerat's (Gallus sonneratii) jungle fowl with identical inte...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017